Dialogue Segmentation with Large Numbers of Volunteer Internet Annotators

نویسنده

T. Daniel Midgley

چکیده

This paper shows the results of an experiment in dialogue segmentation. In this experiment, segmentation was done on a level of analysis similar to adjacency pairs. The method of annotation was somewhat novel: volunteers were invited to participate over the Web, and their responses were aggregated using a simple voting method. Though volunteers received a minimum of training, the aggregated responses of the group showed very high agreement with expert opinion. The group, as a unit, performed at the top of the list of annotators, and in many cases performed as well as or better than the best annotator.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Meaning Unit Segmentation in English and Chinese: a New Approach to Discourse Phenomena

We present a new approach to dialogue processing in terms of “meaning units”. In our annotation task, we asked speakers of English and Chinese to mark boundaries where they could construct the maximal concept using minimal words. We compared English data across genres (news, literature, and policy). We analyzed the agreement for annotators using a state-ofthe-art segmentation similarity algorit...

متن کامل

Counting in the Wild

In this paper we explore the scenario of learning to count multiple instances of objects from images that have been dot-annotated through crowdsourcing. Specifically, we work with a large and challenging image dataset of penguins in the wild, for which tens of thousands of volunteer annotators have placed dots on instances of penguins in tens of thousands of images. The dataset, introduced and ...

متن کامل

ACT: a graphical dialogue annotation comparison tool

Although there are a number of tools for annotating dialogue, little work has been done in visualizing differences among annotations. Visualization of differences in annotation should help in doing consensus annotation, refining an annotation scheme and training novice annotators. In this paper, we present a graphical Annotation Comparison Tool (ACT), which displays multiple annotations side by...

متن کامل

Automatic turn segmentation for Movie & TV subtitles

Movie and TV subtitles contain large amounts of conversational material, but lack an explicit turn structure. This paper present a data-driven approach to the segmentation of subtitles into dialogue turns. Training data is first extracted by aligning subtitles with transcripts in order to obtain speaker labels. This data is then used to build a classifier whose task is to determine whether two ...

متن کامل

Evaluating Dialogue Act Tagging with Naive and Expert Annotators

In this paper the dialogue act annotation of naive and expert annotators, both annotating the same data, are compared in order to characterise the insights annotations made by different kind of annotators may provide for evaluating dialogue act tagsets. It is argued that the agreement among naive annotators provides insight in the clarity of the tagset, whereas agreement among expert annotators...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2009

Dialogue Segmentation with Large Numbers of Volunteer Internet Annotators

نویسنده

چکیده

منابع مشابه

Meaning Unit Segmentation in English and Chinese: a New Approach to Discourse Phenomena

Counting in the Wild

ACT: a graphical dialogue annotation comparison tool

Automatic turn segmentation for Movie & TV subtitles

Evaluating Dialogue Act Tagging with Naive and Expert Annotators

عنوان ژورنال:

اشتراک گذاری